Picture for Sean Welleck

Sean Welleck

Verus-SpecGym: An Agentic Environment for Evaluating Specification Autoformalization

Add code
May 26, 2026
Viaarxiv icon

On the limits and opportunities of AI reviewers: Reviewing the reviews of Nature-family papers with 45 expert scientists

Add code
May 20, 2026
Viaarxiv icon

Reinforcing Human Behavior Simulation via Verbal Feedback

Add code
May 19, 2026
Viaarxiv icon

Gym-Anything: Turn any Software into an Agent Environment

Add code
Apr 07, 2026
Viaarxiv icon

Making Written Theorems Explorable by Grounding Them in Formal Representations

Add code
Apr 03, 2026
Viaarxiv icon

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

Add code
Mar 19, 2026
Viaarxiv icon

Argument Reconstruction as Supervision for Critical Thinking in LLMs

Add code
Mar 18, 2026
Viaarxiv icon

Mind the Sim2Real Gap in User Simulation for Agentic Tasks

Add code
Mar 11, 2026
Viaarxiv icon

GradAlign: Gradient-Aligned Data Selection for LLM Reinforcement Learning

Add code
Feb 25, 2026
Viaarxiv icon

Reasoning with Latent Tokens in Diffusion Language Models

Add code
Feb 03, 2026
Viaarxiv icon